52 research outputs found

    Semi-supervised multiscale dual-encoding method for faulty traffic data detection

    Full text link
    Inspired by the recent success of deep learning in multiscale information encoding, we introduce a variational autoencoder (VAE) based semi-supervised method for detection of faulty traffic data, which is cast as a classification problem. Continuous wavelet transform (CWT) is applied to the time series of traffic volume data to obtain rich features embodied in time-frequency representation, followed by a twin of VAE models to separately encode normal data and faulty data. The resulting multiscale dual encodings are concatenated and fed to an attention-based classifier, consisting of a self-attention module and a multilayer perceptron. For comparison, the proposed architecture is evaluated against five different encoding schemes, including (1) VAE with only normal data encoding, (2) VAE with only faulty data encoding, (3) VAE with both normal and faulty data encodings, but without attention module in the classifier, (4) siamese encoding, and (5) cross-vision transformer (CViT) encoding. The first four encoding schemes adopted the same convolutional neural network (CNN) architecture while the fifth encoding scheme follows the transformer architecture of CViT. Our experiments show that the proposed architecture with the dual encoding scheme, coupled with attention module, outperforms other encoding schemes and results in classification accuracy of 96.4%, precision of 95.5%, and recall of 97.7%.Comment: 16 pages, 8 figure

    Co-supervised learning paradigm with conditional generative adversarial networks for sample-efficient classification

    Full text link
    Classification using supervised learning requires annotating a large amount of classes-balanced data for model training and testing. This has practically limited the scope of applications with supervised learning, in particular deep learning. To address the issues associated with limited and imbalanced data, this paper introduces a sample-efficient co-supervised learning paradigm (SEC-CGAN), in which a conditional generative adversarial network (CGAN) is trained alongside the classifier and supplements semantics-conditioned, confidence-aware synthesized examples to the annotated data during the training process. In this setting, the CGAN not only serves as a co-supervisor but also provides complementary quality examples to aid the classifier training in an end-to-end fashion. Experiments demonstrate that the proposed SEC-CGAN outperforms the external classifier GAN (EC-GAN) and a baseline ResNet-18 classifier. For the comparison, all classifiers in above methods adopt the ResNet-18 architecture as the backbone. Particularly, for the Street View House Numbers dataset, using the 5% of training data, a test accuracy of 90.26% is achieved by SEC-CGAN as opposed to 88.59% by EC-GAN and 87.17% by the baseline classifier; for the highway image dataset, using the 10% of training data, a test accuracy of 98.27% is achieved by SEC-CGAN, compared to 97.84% by EC-GAN and 95.52% by the baseline classifier.Comment: 14 pages, 5 figure

    Monitoring and Assessing Traffic Safety at Signalized Intersections Using Live Video Images

    Get PDF
    0013527Signalized intersections represent the most hazard spots on a roadway network. Road users are required to be alert and timely process and respond to a variety of information at signalized intersections, including traffic signal indications and changes, signage, pavement marking, road conditions, and a mix of various road users in conflict. Traditional road safety diagnosis has been conducted in a reactive manner based on crashes that had occurred. However, to effectively reduce and eventually eliminate crashes, proactive approaches are needed. Following this direction, traffic conflict events have been collected more frequently and used as a surrogate safety measure for traffic crashes. The goal of Vision Zero would only be possible if the inconsequential event data, such as traffic conflicts, can be objectively and systematically collected and effectively utilized to diagnose and improve road safety such that consequential crash events can be prevented. In this study, the art of deep learning, multiple objects detection and tracking were explored and tested in the domain of traffic conflict monitoring and assessing. As a result, an artificial intelligence (AI) enhanced computational system was developed to automate the detection and quantification of traffic conflict events as they occur in real time using traffic monitoring cameras currently installed by transportation agencies

    Hepatitis D double reflex testing of all hepatitis B carriers in low-HBV- and high-HBV/HDV-prevalence countries

    Get PDF
    Hepatitis D virus (HDV) infection occurs as a coinfection with hepatitis B and increases the risk of hepatocellular carcinoma, decompensated cirrhosis, and mortality compared to hepatitis B virus (HBV) monoinfection. Reliable estimates of the prevalence of HDV infection and disease burden are essential to formulate strategies to find coinfected individuals more effectively and efficiently. The global prevalence of HBV infections was estimated to be 262,240,000 in 2021. Only 1,994,000 of the HBV infections were newly diagnosed in 2021, with more than half of the new diagnoses made in China. Our initial estimates indicated a much lower prevalence of HDV antibody (anti-HDV) and HDV RNA positivity than previously reported in published studies. Accurate estimates of HDV prevalence are needed. The most effective method to generate estimates of the prevalence of anti-HDV and HDV RNA positivity and to find undiagnosed individuals at the national level is to implement double reflex testing. This requires anti-HDV testing of all hepatitis B surface antigen-positive individuals and HDV RNA testing of all anti-HDV-positive individuals. This strategy is manageable for healthcare systems since the number of newly diagnosed HBV cases is low. At the global level, a comprehensive HDV screening strategy would require only 1,994,000 HDV antibody tests and less than 89,000 HDV PCR tests. Double reflex testing is the preferred strategy in countries with a low prevalence of HBV and those with a high prevalence of both HBV and HDV. For example, in the European Union and North America only 35,000 and 22,000 cases, respectively, will require anti-HDV testing annually

    A machine learning model pipeline for detecting wet pavement condition from live scenes of traffic cameras

    No full text
    Highway safety is largely influenced by weather conditions that have become increasingly volatile due to the climate change. It well known that wet pavement significantly reduces surface friction, leading to inflated collision risk. Thus, timely knowledge of the road surface condition is critical for safe driving. In this paper, a novel machine learning model pipeline is proposed to detect the wetness of pavement based on live images of highway scenes captured by publicly accessible traffic cameras. To simplify the learning task, we finetuned the state-of-the-art instance segmentation baseline models to extract background instance targets, including pavement, sky, and vegetation, which are common in highway scenes. Then, the color mixture attributes in HSV (hue, saturation and value) of each segmented instance were extracted and used as visual cues for inferring pavement condition. Finally, gradient boosting ensemble classifiers are constructed and trained using the HSV features to predict the wetness of pavement. For the segmentation task, we leveraged Detectron2 baseline models (Mask R-CNN) and evaluated three backbone networks: R50-FPN, R101-FPN, and X101-FPN. For the classification task, two most popular gradient boosting algorithms (XGBoost and CatBoost) were evaluated together with a classic logistic model. Based on experiments with our custom dataset, the best performance (F1 score: 0.927, AUC: 0.975) was achieved by the R101-FPN backbone coupled with the CatBoost classifier

    Understanding Multi-Vehicle Collision Patterns on Freeways—A Machine Learning Approach

    No full text
    Generating meaningful inferences from crash data is vital to improving highway safety. Classic statistical methods are fundamental to crash data analysis and often regarded for their interpretability. However, given the complexity of crash mechanisms and associated heterogeneity, classic statistical methods, which lack versatility, might not be sufficient for granular crash analysis because of the high dimensional features involved in crash-related data. In contrast, machine learning approaches, which are more flexible in structure and capable of harnessing richer data sources available today, emerges as a suitable alternative. With the aid of new methods for model interpretation, the complex machine learning models, previously considered enigmatic, can be properly interpreted. In this study, two modern machine learning techniques, Linear Discriminate Analysis and eXtreme Gradient Boosting, were explored to classify three major types of multi-vehicle crashes (i.e., rear-end, same-direction sideswipe, and angle) occurred on Interstate 285 in Georgia. The study demonstrated the utility and versatility of modern machine learning methods in the context of crash analysis, particularly in understanding the potential features underlying different crash patterns on freeways

    Enhancing Reliability Analysis with Multisource Data: Mitigating Adverse Selection Problems in Bridge Monitoring and Management

    No full text
    Data collected using sensors plays an essential role in active bridge health monitoring. When analyzing a large number of bridges in the U.S., the National Bridge Inventory data as been widely used. Yet, the database does not provide information about live loads, one of the most indeterminate variables for monitoring bridges. Such asymmetric information can lead to an adverse selection problem in making maintenance, rehabilitation, and repair decisions. This study proposes a data-driven reliability analysis to assess probabilities of bridge failure by synthesizing NBI data and Weigh-In-Motion (WIM) data for a large number of bridges in Georgia. On the resistance side, tree ensemble methods are employed to support the hypothesis that the NBI operating load rating represents the distribution of bridge resistance capacities which change over time. On the loading side, the live load distribution is derived from field data collected using WIM sensors. Our results show that the proposed WIM data-enabled reliability analysis substantially enhances information symmetry and provides a reliability index that supports monitoring of bridge conditions, depending on live loads and load-carrying capacities

    Deep-Learning-Based Temporal Prediction for Mitigating Dynamic Inconsistency in Vehicular Live Loads on Roads and Bridges

    No full text
    Weigh-In-Motion (WIM) data have been collected by state departments of transportation (DOT) in the U.S. and are anticipated to grow as state DOTs expand the number of WIM sites in order to better manage transportation infrastructure and enhance mobility. Traditional approaches for monitoring the vehicle weight measured in WIM systems include conducting statistical tests between two datasets obtained from two calibration visits. Depending on the frequency of visits, these traditional approaches are ineffective or resource-demanding for identifying calibration needs. Excessive vehicle-weight drifts exceeding 10% are usually indicative of poor performance by WIM systems. However, it has been difficult to consistently monitor such performance due to the sheer amount of data. In Georgia, the number of WIM sites have expanded from 12 to 29 in the past 3 years. This paper proposes a deep-learning-based temporal prediction approach for modeling sequential data and monitoring the time-history of the live loads imposed on roads and bridges. In total, 29 WIM sites in Georgia are analyzed to examine the effectiveness of a proposed temporal prediction approach for evaluating observed live loads. This study finds that the Jensen–Shannon divergence method is more effective than statistical difference tests, particularly when screening for live load anomalies. It is concluded that a LSTM neural network is able to capture temporal dynamics underlying the sequential load patterns observed in the WIM data and serves as an effective model for consistently monitoring the performance of WIM systems over time
    • …
    corecore